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Abstract Linearly bounded Turing machines have been mainly studied as 
acceptors for context-sensitive languages. We define a natural class of infi- 
nite automata representing their observable computational behavior, called 
linearly bounded graphs. These automata naturally accept the same lan- 
guages as the linearly bounded machines defining them. We present some of 
their structural properties as well as alternative characterizations in terms of 
rewriting systems and context-sensitive transductions. Finally, we compare 
these graphs to rational graphs, which are another class of automata accept- 
ing the context-sensitive languages, and prove that in the bounded-degree 
case, rational graphs are a strict sub-class of linearly bounded graphs. 



1 Introduction 

One of the cornerstones of formal language theory is the hierarchy of lan- 
guages introduced by Chomsky in [9] . It rests on the definition of four increas- 
ingly restricted classes of grammars, which respectively generate the recur- 
sively enumerable, context-sensitive, context-free and rational languages. All 
these classes were extensively studied, and have been given several alterna- 
tive characterizations using different kinds of formalisms (or acceptors). For 
instance, pushdown systems characterize context-free languages, and linearly 
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bounded Turing machines (LBMs) characterize context-sensitive languages. 
More recently, several authors have related these four classes of languages to 
classes of infinite graphs (for a survey, see for instance [27]). Given a fixed 
initial vertex and a set of final vertices, one can associate a language to a 
graph by considering the set of all words labeling a path between the initial 
vertex and one of the final vertices. In [8], a summary of four classes of graphs 
accepting the four classes of the Chomsky hierarchy is presented. They are 
the Turing graphs [6], rational graphs [20,21,24], prefix-recognizable graphs 
[4,5], and finite graphs. 

Several approaches exist to define classes of infinite graphs, among which 
we will cite three. The first one is to consider the finite acceptor of a formal 
language, and to build a graph representing the structure of its computa- 
tions: vertices represent configurations, and each edge reflects the observable 
effect of an input on the configuration. One speaks of the transition graph of 
the acceptor. An interesting consequence is that the language of the graph 
can be deduced from the language of the acceptor it was built from. A second 
method proposed in [8] is to consider the Cayley-type graphs of some classes 
of word rewriting systems. Each vertex is a normal form for a given rewriting 
system, and an edge between two vertices represents the addition of a letter 
and re-normalization by the rewriting system. Finally, a third possibility is 
to directly define the edge relations in a graph using automata or other for- 
malisms. One may speak of derivation, transduction or computation graphs. 
In this approach, a path no longer represents a run of an acceptor, but rather 
a composition of binary relations. 

Both prefix-recognizable graphs and Turing graphs have alternative def- 
initions along all three approaches. Prefix-recognizable graphs are defined 
as the graphs of recognizable prefix relations. In [25], Stirling characterized 
them as the transition graphs of pushdown systems. It was also proved that 
they coincide with the Cayley-type graphs of prefix rewriting systems. As 
for Turing graphs, Caucal showed that they can be seen indifferently as the 
transition and computation graphs of Turing machines [6]. They are also 
the Cayley-type graphs of unrestricted rewriting systems. Rational graphs, 
however, are only defined as transduction graphs (using rational transduc- 
ers) and as the Cayley-type graphs of left-overlapping rewriting systems, and 
lack a characterization as transition graphs. In this paper, we are interested 
in defining a suitable noti(m of transition graphs of linearly bounded Turing 
machines, and to determine some of their structural properties as well as to 
compare them with rational graphs. 

As in [6] for Turing machines, we first define a labeled version of LBMs, 
called LLBMs. Their transition rules are labeled either by a symbol from 
the input alphabet or by a special symbol denoting an internal, unobservable 
transition. Following an idea from [25], we consider that in every config- 
uration of a LLBM, either internal actions or inputs are allowed, but not 
both at a time. This way, we can distinguish between internal and external 
configurations. The transition graph of a LLBM is the graph whose vertices 
are external configurations, and whose edges represent an input followed by 
any finite number of silent transitions. This definition is purely structural 
and associates a unique graph to any given LLBM. For convenience, we call 
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such graphs linearly bounded graphs. A similar work was proposed in [16,23], 
where the class of configuration graphs of LBMs up to weak bisimulation is 
studied. However, it provides no formal definition associating LBMs to a class 
of real-time graphs (without edges labeled by silent transitions) representing 
their observable computations. 

To further illustrate the suitability of our notion, wc provide two alter- 
native definitions of linearly bounded graphs. First, we prove that they are 
isomorphic to the Cayley-type graphs of length- decreasing rewriting systems. 
The second alternative definition directly represents the edge relations of a 
linearly bounded graph as a certain kind of context-sensitive transductions. 
This allows us to straightforwardly deduce structural properties of linearly 
bounded graphs, like their closure under synchronized product (which was 
already known from [16]) and under restriction to a context-sensitive set of 
vertices. To conclude this study, we show that linearly bounded graphs and 
rational graphs form incomparable classes, even in the finite degree case. 
However, bounded degree rational graphs are a strict sub-class of linearly 
bounded graphs. 

2 Preliminary Definitions 

A labeled, directed and simple graph is a set G C V x Z' x ^ where S is 
a finite set of labels and V a countable set of vertices. An element (s, a, t) 
of G is an edge of source s, target t and label a, and is written s t or 

G 

simply s t if G is understood. The set of all sources and targets of a 
graph is its support Vg- Two graph G,H C V x S x V are isomorphic if 
there exists a bijection p from Vh to Vg such that for all x,y € Vh, x — > y 

mp{x) -^p{y). 

A sequence of edges s\ t\,. ..,Sk tk with Vz e [2, k], Sj = ti-\ 
is a path. It is written si — ^ t^, where u = ai . . .a^ is the corresponding 
path label. A graph is deterministic if it contains no pair of edges with the 
same source and label. One can relate a graph to a language by considering 
its path language, defined as the set of all words labeling a path between two 
given sets of vertices. 

Definition 1 The (path) language of a graph G between two sets of vertices 
/ and F is the set 

L{G,I,F) = {w\s^t, sGl, teF}. 



2.1 Linearly bounded Turing machines 

We recall the definitions of context-sensitive languages and linearly bounded 
Turing machines. A context-sensitive language is a set of words genc;ratc;d 
by a grammar whose production rules are of the form a —>■ jS with \f3\ > 
\a\. Such grammars are called context-sensitive or sometimes growing. A 
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more operational definition of context-sensitive languages is as the class of 
languages accepted by linearly bounded Turing machines (LBMs). 

Definition 2 A linearly bounded Turing machine is a tuple M = {r,IJ, [,], 

Q, qo, F, S), whore 

— r' is a finite set of tape symbols, 

— S C r is the input alphabet which docs not contain the symbol s, 

— [ and ] ^ r are the left and right end-marker, 

— Q is a finite set (disjoint from F) of control states, 

— qo & Q is the unique initial state, 

— F C Q is a set of final states, 

— 6 is a finite set of transition rules of one of the forms: 

pA — >qB± p[ — >q[+ p] — >q]- 

with p,q e Q, A,B € F and ± G {+, -}. 

As usual in the syntax of Turing machines, symbols + and — in transition 
rules respectively denote a move of the read head to the right and to the left. 
The set of configurations Cm of M is the set of words uqv such that q £ Q, 
V ^ s and uv G \F*]. The transition relation — > is a subset of Cm x Cm 

M 

defined as: 

— > = { (upAv, uBqv) I pA — > qB-\- & 5 } 

M 

U { [uCpAv, uqCBv) | pA — > qB- G 5} 

We will simply write — »■ when M is understood. For any input word w & E* , 
the unique initial configuration is [qow] and a final configuration c/ is a 
configuration containing a terminal control state. A word w is accepted by 
M if [qQw] — > Cf where c/ is a final configuration. Quite naturally, M is 
deterministic if, from any configuration, at most one rule can be applied. 
Formally, for all configurations c, c\ and C2 such that c — > c\ and c — > C2, 
then ci = C2- 

Other definitions of linearly bounded machines do not use the border 
symbols to constrain the head inside a portion of the tape whose size equals 
the size of the input. Instead, they externally require that the tape size used 
at any point during any computation be at most k times the size of the input, 
where A; is a fixed constant. It is a well-known fact that k can be considered 
equal to 1 without loss of generality, and that these definitions are equivalent 
to the one given above. An interesting open problem raised by Kuroda [17] 
concerns deterministic context-sensitive languages, which are the languages 
accepted by deterministic LBMs. It is not known whether they coincide with 
non-deterministic context-sensitive languages, as is the case for recursively 
enumerable or rational languages. 

Note that, contrary to unbounded Turing machines, it is sufficient to 
only consider linearly bounded machines which always terminate, also called 
terminating machines. This is expressed by the following proposition: 

Proposition 1 For every linearly bounded Turing machine, there exists a 
terminating linearly bounded Turing machine recognizing the same language. 
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Proof It suffices to show that, for every hnearly bounded machine M whose 
configuration graph contains a cycle, there is an equivalent terminating ma- 
chine M', i.e. a machine which always terminates and accepts the same lan- 
guage as M. The total number of distinct configurations of M during a run 
on a word of size n is bounded by k", where fc is a constant depending on the 
size of the control state set and work alphabet of M. It is easy to see that a 
word w accepted by M must be accepted by at least one run of size less than 
fcl^l. Indeed, if the smallest run accepting w was longer than this bound, it 
would necessarily contain two occurrences of the exact same configuration, 
i.e. a cycle. By removing this cycle, one would obtain a shorter accepting 
run. Let M' be the machine which simulates M while incrementing a n-digit 
counter in base k encoded in the alphabet and stops the run if the counter 
overflows. By construction, M' accepts the same language as M and is ter- 
minating. □ 

Similar arguments are used in [17] to show that the set of words on which 
a linearly bounded machine has an infinite run is context-sensitive. Another 
property of context-sensitive languages is that they are closed under union 
and complement. 

Theorem 1 ([14,26]) The context-sensitive languages over a finite alphabet 

r form an effective Boolean algebra. 

The notion of space-bounded Turing machine can be extended to any 
bound / : N i-> N such that for ah n > 0, /(n) > log(n). The set of all 
languages accepted by a Turing machine working in space f{n) on an entry 
of size n is written NSPACE[/(n)]. The following theorem states that the 
space hierarchy is strict. 

Theorem 2 ([13,14]) For all space- constructible f and g in'H such 

that lim„^+oo 4^ = 0, we have NSPACE[/(n)] C NSPACE[g(n)]. 
9{n) 

In particular, the class NSPACE[2"] of languages recognizable in expo- 
nential space strictly contains the class of context-sensitive languages (or 
NSPACE[n]). 



2.2 Rational graphs 

Consider the product monoid S* x S*, whose elements are pairs of words 
(m, f) in S* , and whose composition law is defined by (ui,ui) • (^2,^2) = 
{uiU2,v\V2)- A finite transducer is an automaton over S* x S* with labels 
in {E U {e}) x (17 U {e}). Transducers accept the rational subsets of E* x E*, 
which are seen as binary relations on words and called rational transductions. 
We do not distinguish a transducer from the relation it accepts and write 
{w,w') e T if the pair {w,w') is accepted by T. Graphs whose vertices are 
words and whose edge relations are defined by transducers (one per letter in 
the label alphabet) are called rational graphs. 
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Definition 3 ([20]) A rational graph labeled by E with vertices in F* is 
given by a class of transducers {Ta)aei: over F. For all a G S, {u,a,v) € G 
if and only if (u, v) G Ta- 

For w £ and a G Z", wc recursively define Ty^a = ° To, where o 
denotes the standard relational composition, and wc write u v if and only 
if {u,v) G T-u}. In general, there is no bound on the size difference between 
input and output in a transducer (and hence between the lengths of two 
adjacent vertices in a rational graph). Interesting sub-classes are obtained by 
enforcing some form of synchronization. The most well-known was defined 
by Elgot and Mezei [11] (see also [12]) as follows. A transducer over S with 

initial state qo is (left-)synchronizcd if for every path qo qi...qn-i 
xn/vri there exists k G [0, n] such that for all i G [1,A;], Xi and yt belong 
to E and either Xk+i ■ ■ - Xn — £ and Uk+i ■ ■ - Vn € S* or yk+i ■ ■ - Vn = £ and 
Xfe+i . . . Xn & E* . A rational graph defined by synchronized transducers will 
simply be called a synchronized (rational) graph. 

Rational graphs form a class of infinite acceptors for context-sensitive 
languages. For a discussion on the expressive power of the sub-classes of 
rational graph seen as language acceptors see [3] . 

Theorem 3 ([21]) The languages accepted by the rational graphs between 
a rational set of initial vertices and a rational set of final vertices are the 
context-sensitive languages. 

This result can be slightly strengthened in the case of a single initial and 

final vertex. 

Corollary 1 For any rational graph G labeled by E, pair (i, /) of vertices 
of G, and symbol ^ E, the language Lq = {ii,w'if \ w £ L{G, {i}, {/})} is 
context-sensitive. 

Proof Let G be a rational graph labeled by E with vertices in F* and defined 
by a family of transducers {Ta)aes- Let F and F be two finite alphabets 
disjoint from but in bijection with F. For any x G F, we write x (resp. x) 
the corresponding symbol in F (resp. F). We consider the rational graph H 
labeled hy S = EUFUF defined by the family of transducers {Tx)x&e where 
for all xGF,Ts = {{u,ux) \ u € F*} and = {{xu,u) \ u G F*}. 

By Thm. 3, the language L = L{H,£,£) n r*E*r* is context-sensitive. 
By construction, L is equal to {iwf \ i,f € F* and w G L{G , {i} , {f})} . It 
follows that Lc is context-sensitive. □ 

3 Linearly Bounded Graphs 

3.1 LBM Transition Graphs 

Following [6] , we define the notion of labeled linearly bounded Turing machine 
(LLBM). This notion is very close to the notion of off-line Turing machine 
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[13]. It is essentially equivalent to an off-line Turing machine with a one- 
way input tape and a two-way linearly bounded work tape. As in standard 
definitions of LBMs, the transition rules can only move the head of the LLBM 
between the two end markers [ and ]. In addition, a silent step can decrease the 
size of the configuration (without removing the markers) and a I7-transition 
can increase the size of the configuration by one cell. This ensures that while 
reading a word of length n, the labeled LBM uses at most n cells. 

Definition 4 A labeled linearly bounded Turing machine is a tuple M = 

{r. Q, qo, F, S) as in Definition 2. where all components but S are de- 

fined similarly, and (5 is a finite set of labeled transition rules of one of the 
forms: 

pA -UqB± p[-^q[+ p] -^q]- 

pB -^qAB p] -^qA] pA -^q 

with p,q e Q, A,B e r, ± e {+, -} and aG E. 

Configurations are defined similarly to the previous case. However, the 
transition relation is now labeled. For all x G EU is}, the relation is a 

M 

subset of Cm x Cm defined as: 

^ = { iupAv,uBqv) I pA qB+ g6} 

M 

U { {uCpAv,uqCBv) \ pA qB- e 6} 

{{ {upAv, uqv) I pA — ^ qG 5} with x = e 

{ {upAv, uqBAv) \ pA qBA G 8) with x G S. 

Transitions are composed by defining as (— ^ o — ^) for all w G S*. 
The definition of runs also changes to reflect the fact that input words are 
no longer present on the tape but are read as the run progresses. The unique 
initial configuration is thus [qo]. A word w is accepted by M if [qo] Cf 
where c / is a final configuration. M is deterministic if, from any configuration, 
either all possible moves are labeled by distinct letters of S, or there is only 
one possible move labeled by e. Formally, for all configurations c, ci and C2 
such that c c\ and c C2, either x and y belong to S and if Ci ^ C2 
then xj^y,ovx = y = s and ci = C2. 

Labeled LBMs are as expressive as classical LBMs. 

Proposition 2 A language is (deterministic) context-sensitive if and only if 
it is accepted by a (deterministic) labeled linearly bounded Turing machine. 

Proof Let us first consider a context-sensitive language L accepted by a ter- 
minating LBM N (by Prop. 1). We sketch the construction of a LLBM 
M = {r, S,[,],Q,qo,F,d) accepting L. Let qA,qR and qs be three states 
in Q. In a configuration of the form [wqx] for w G S* and qx G {qA,qR}, 
M reads an input letter a E S and goes to the configuration [waqs]. Then it 
simulates N on wa using only e-transitions while remembering wa. Using the 
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same work alphabet as N, this would require the use of 2|wa| cells. However 
using an increased work alphabet (or a second work tape), this can be done 
using only |wa| cells. If A'' accepts (resp. rejects) wa, M restores wa on its 
(primary) work tape and steps in configuration [waq^^] (resp. [waqft]). By 
taking F = {qa} and qo = g^i if e belongs to L and qq ~ qu otherwise, it 
is easy to see that the set of words accepted by M from [qo] is precisely L. 
Note that non-deterministic behavior can only appear in A'' while simulating 
M. Hence, if M is deterministic then so is N. 

Conversely, let M be a LLBM accepting a language L C S*. We describe 
a LBM N accepting L with two tapes: the input tape and work tape. It is 
well known that this model is equivalent to LBMs with one tape as presented 
in Sec. 2.1 (see for example [13]). To each tape corresponds a set of control 
states: the set of work states Qp contains the set Q of states of M and the set 
of input states Qi; is reduced to {qi, g^} respectively the initial and accepting 
state. 

The machine N working on word w S S* starts with the configTiration 
([(7iw], [go])- From a configuration {[wiqiW2],[uqv]) with Wi,W2 G S* , q G 
Q and u,v € -T*, N can non-deterministically simulate any e-transition of 
N that can be applied to the configuration represented by the work tape. 
The deletion rules are simulated by shifting all tape content on the right of 
the read head by one cell to the left. Moreover N can non-deterministically 
simulate any a-transition for a € S provided that the input head is on top 
of the symbol a in which case it is moved one cell to the right. The insertion 
rules are simulated by shifting the work tape content to the right of the read 
head by one cell on the right. 

The machine M enters the accepting state qA if the input head is on top of 
the right border symbol ] and the work state is a final state of N. It follows 
from the construction of M that w is accepted by M if and only if it is 
accepted by N. Moreover, if N is deterministic then so is M. □ 

Remark 1 For convenience, one may consider LBMs whose initial configura- 
tion is not of the form [go] but is any fixed configuration cq. This does not 
add any expressive power, as can be proved by a simple encoding of cq into 
the control state set of the machine, which will not be detailed here. 

Remark 2 For simplicity, the above definition forces the insertion of a new 
tape cell each time a letter is read. More relaxed forms where a cell deletion 
or rewriting can occur during an input may be considered without any con- 
sequence for the results. Similarly, rules which do not move the read head 
can be allowed. 

Let M = {r,S,[,],Q,qo,F,S) be a LLBM, we define its configuration 
graph 

Cm = {(c,a,c')\c-^c' for a e i: U {e} |. 

M 

The vertices of this graph arc all configurations of M, and its edges de- 
note the transitions between them, including £-transitions. One may wish to 
only consider the behavior of M from an external point of view, i.e. only 
looking at the sequence of inputs. This means one has to find a way to con- 
ceal e-transitions without changing the accepted language or destroying the 
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structure. One speaks of the transition graph of an acceptor, as opposed to 
its configuration graph. 

In [25] , Stirling mentions a normal form for pushdown automata which al- 
lows him to consider a structural notion of transition graphs, without relying 
on the naming of vertices. Wc first recall this notion of normalized systems 
adapted to labeled LBMs. A labeled LBM is normalized if its set of control 
states can be partitioned in two subsets: one set of internal states, noted Qs, 
which can perform e-rules and only e-rules, and a set of external states noted 
Qs, which can only perform U-rules. More formally: 

Definition 5 A labeled LBM M = {r,S,[,],Q,qo,F,6) is normalized if 
there are disjoint sets Qs and Qe such that Q = Qe^Qs, F C Q^, and 

pB qAB eS ^ p&Qe, 

pA qB± &5 ox pA q&5 => p e Qe, 

peQe ^ for all A e r, 3pA qB± e 5. 

This definition implies in particular that a control state from which there 

exists no transition must belong to Qs- A configuration is external if its 
control state is in Qs, and internal otherwise. This makes it possible to 
structurally distinguish between internal vertices, which have one or more 
outgoing e-edgcs, and external ones which only have outgoing Z'-edges or 
have no outgoing edges. Given any labeled LBM, it is always possible to 
normalize it without changing the accepted language. 

Proposition 3 Every labeled linearly bounded machine can be normalized 
without changing the accepted language. 

Proof Let M be a LLBM, we know from Prop. 2 that the language L accepted 
by M is context-sensitive. Consider the LLBM M' accepting L obtained in 
the construction of the proof of Prop. 2. It is easy to see that M' is normalized: 
qA and qa are the only states that can perform transitions labeled by E, 
and they cannot perform e-transitions. All other states can only perform 
e-transitions. To verify the third condition in the definition of normalized 
LLBMs, it suffices to add instruction pA pA for every p and A for which 
the condition is violated. Moreover, the unique terminal state is qA, which is 
external. □ 

Remark 3 Normalization as previously described preserves determinism. 

From this point on, unless otherwise stated, we will only consider nor- 
malized LLBMs. We can now define the transition graph of a LLBM as the 
e-closure of its configuration graph, followed by a restriction to its set of 
external configurations (which happens to be a rational set). 

Definition 6 Let M = [r,E,[,],Q,qo,F,5) be a (normalized) LLBM, and 
Ce be its set of external configurations. The transition graph of M is 

Gm = {(c, a,c') I c, c' e Cs, a e S, Ac c' }. 
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Fig. 1 The transition graph of a labeled LBM accepting {(a"6")+ | n > 1}. 



We now define the class of linearly bounded graphs as the closure under 
isomorphism of transition graphs of labeled LBMs. 

Example 1 Figure 1 shows the transition graph of the normalized LLBM M 
whose rules are: 

9o] qoa] qia qib+ 926 qsa- qab qsa- 
qoa-^qoaa 9i] ^ 92]- 53[^9i[+ 
qoa qib+ 

and whose unique accepting state is 52 • This machine accepts the language 
{(a"6")+ \ n > 1}, which is also the language of paths of the graph between 
vertex [qo] and the set [b*q2b]. For the sake of clarity, only the part of the 
graph reachable from configuration [qo] is shown. We will see in Sect. 4 that 
this sub-graph is still a linearly bounded graph. 



3.2 Alternative definitions 

This section provides two alternative definitions of linearly bounded graphs. 
In [8] , it is shown that all previously mentioned classes of graphs can be ex- 
pressed in a uniform way in terms of Cayley-type graphs of certain classes 
of rewriting systems. We show that it is also the case for linearly bounded 
graphs, which arc the Caylcy-typc graphs of length-decreasing rewriting sys- 
tems. The second alternative definition we present changes the perspective 
and directly defines the edges of linearly bounded graphs using incremental 
context-sensitive transductions. This variety of definitions will allow us to 
prove in a simpler way some of the properties of linearly bounded graphs. 

3.2.1 Cayley-type graphs of decreasing rewriting systems 

We first give the relevant definitions about rewriting systems and Cayley-type 
graphs. A word rewriting system R over alphabet 7^ is a subset of F* x F*. 
Each element r) £ i? is called a rewriting rule and noted / r. The words 
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Fig. 2 Cayley-type graph of the rewriting system _R = {o — > 0, 6 — » 6, 06 — » 1, 
16 — »■ 60, c — > c, Ic —>■ e}, with S = {a, 6, c} and F = {o, 6, c, 0, 1}. Rules b ^ b 
and c ^ c ensure that no normal form of R contains b or c. As an example, edge 

01 10 is justified by the derivation 016 060 10. 



I and r arc respectively called the left- and right-hand side of the rule. The 
rewriting relation of R is the binary relation 

{{ulv,urv) \ u,v € r*, I ^ r € R} 

which we also denote by R, consisting of all pairs of words (wi, W2) such that 
W2 can be obtained by replacing (rewriting) an instance of a left-hand side I 
in wi with the corresponding right-hand side r. The reflexive and transitive 
closure R* of this relation is called the derivation of R. Whenever for some 
words u and v we have uR*v, we say R rewrites u into v. A word which 
contains no left-hand side is called a normal form. The set of all normal 
forms of R is written NF(i?). 

One can associate a unique infinite graph to any rewriting system by 
considering its Cayley-type graph defined as follows: 

Definition 7 The i7-labeled Cayley-type graph of a rewriting system R over 
r, with S C r, is the infinite graph 

Gr = {{u,a,v) \ aG S, u,v G NF(i?), uaR*v}. 

The class of rewriting systems we consider is that of finite length decreas- 
ing word rewriting systems, i.e. rewriting systems with a finite set of rules of 
the form I — > r with |^| > |r|, which can only preserve or decrease the length 
of the word to which they are applied. The reason for this choice is that the 
derivation relation of such a system coincides with arbitrary compositions of 
£-rules of a given labeled LBM. 

Example 2 Figure 2 shows the Cayley-type graph of a simple decreasing 
rewriting system. 
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3.2.2 Incremental context-sensitive transduction graphs 

The notion of computation graph was first systematically used in [8] (where 
the terminology used is relation graph). It corresponds to the graphs defined 
by the transductions (i.e. binary relations on words) associated to a class of 
finite machines. These works prove that for pushdown automata and Turing 
machines, the classes of transition and computation graphs coincide. We show 
that it is also the possible to give a definition of linearly bounded graphs as 
the computation graphs of a certain class of LBMs, or equivalently as the 
graphs defined by a certain class of context-sensitive transductions. 

A relation R is recognized by a LBM M if the language {u^v \ {u, v) € R} 
where # is a fresh symbol is accepted by M. However, this type of transduc- 
tions generates more than linearly bounded graphs. Even if we only consider 
linear relations (i.e relations R such that there exists c and k G N such that 
{u, v) € R implies < c • \u\ + k), we obtain graphs accepting the languages 
recognizable in exponential space (NSPACE[2"]) which strictly contain the 
context-sensitive languages (cf Thm. 2). We need to consider relations for 
which the length difference between a word and its image is bounded by a 
certain constant. Such relations can be associated to LBMs as follows: 

Definition 8 A k-increm,ental context-sensitive transduction T over F is de- 
fined by a LBM recognizing a language L C \u,v €: F* and \v\ < \u\ + 
k} where # does not belong to F. Relation T is defined as {{u, v) \ u#v G L}. 

The synchronized relations of finite image (i.e for u there are finitely many 
V such that {u,v) £ R) provide a first example of ^-incremental context- 
sensitive transductions. 

Proposition 4 For any synchronized relation R of finite image, there exists 
a constant fc G N such that R is a k-incremental context-sensitive transduc- 
tion. 

Proof Let R C F* x F* he & synchronized relation of finite image. It follows 
from the definition of synchronized relations that R is equal to a finite union: 



where for all fc S / U J, Sk is a length-preserving relation-^ and for alH G / 

and j G J, Ai and Bj are rational subsets of F* . 

As R has a finite image, for all j G J the set Bj is necessarily finite. Lot k be 



of words {u,v) e R, \v\ < Moreover the language {u^v \ {u,v) G R} is 

context-sensitive. Hence i? is a fc-incremental context-sensitive transduction. 

□ 

The following proposition states that incremental context-sensitive trans- 
ductions of a given level form a Boolean algebra. 

^ A synchronized relation R is length-preserving if V(a;,y) G R, \x\ = \y\ 
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Proposition 5 For all k -incremental context-sensitive transductions T and 

T over F* , T UT' , T DT' andT^Ek-T (where Ek is {{u, v) \ < \v\ < 
\u\ + k}) are incremental context-sensitive transductions. 

Proof The closure under union follows from that of context-sensitive lan- 
guages. The proof of closure under complement is a straightforward conse- 
quence of the closure under complement of context-sensitive languages (cf 
Thm. 1). Let T C F* x F* he a /c-incremental context-sensitive transduc- 
tion. By definition, the set L = {u^v \ {u, v) G T} is context-sensitive. It is 
straightforward to check that the set L' = {u#v \ {u, v) G Ek — T} is equal 
to: 

Ln{u#v I |u| < \v\ -I- k}. 

As the context-sensitive languages are closed under complement and inter- 
section, L' is context-sensitive. Hence, T is a A;-incremental context-sensitive 
transduction. □ 

The canonical graph associated to a finite set of transductions is called a 

transduction graph. Relating graphs to a class of binary relations on words 
was already used to define rational graphs and their sub-classes. 

Definition 9 The Z'-labclcd transduction graph of a finite set of incremental 
context-sensitive transductions {Ta)aei: is 

Gt = {{u, a,v) \ a £ S and (u, v) is recognized by Ta}. 

Example 3 The linearly bounded graph of Ex. 1 is isomorphic to the trans- 
duction graph of the following set of incremental context-sensitive transduc- 
tions: 

Ta = {(#a", #a"+i) I n > 0} U {(fe^a", 6'"-ia"+i) | m > 1, n > 0}, 
Tb = {{^a"", a^'-^h) I n > 1} U {(a"*?)", a'^-^fe^+i) | m > 1, n > 0}. 

The symbol 7^ is needed to distinguish a vertex directly reachable through 
a sequence of a's from a vertex reachable through a sequence of the form 
(a"6")*. 

As for the Cayley-type graph of Ex. 2, it can be seen as the transduction 
graph of the set of incremental context-sensitive transductions {Ta,Tb,Tc}, 
where Ta adds a and Tc removes a 1 to the right of a binary number, and 
T(, implements binary increment. 

Length-preserving context-sensitive transductions have already been ex- 
tensively studied in [18]. In the rest of this presentation, unless otherwise 
stated, we will only consider 1-incremental transductions without loss of 
generality regarding the obtained class of graphs: indeed, any fc-incremental 
transduction graph is isomorphic to a 1-incremental one over an increased 
alphabet. 
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3.2.3 Equivalence of all definitions 

We now prove that both classes of Cayley-type graphs of decreasing rewrit- 
ing systems, and incremental context-sensitive transduction graphs define 
precisely the class of linearly bounded graphs, up to isomorphism (i.e. up to 

vertex renaming). 

Theorem 4 For any graph G, the following statements are equivalent: 

1. G is isomorphic to the transitions graph of a labeled LBM, 

2. G is isomorphic to the Cayley-type graph of a finite length- decreasing 
system, 

3. G is isomorphic to a context-sensitive transduction graph. 

Proof 1 =^ 2: Let M = {F, S,[,],Q,qo, F,6) be a normalized labeled 
linearly bounded Turing machine, with S D F = (I). As M is normalized, 
its control states can be partitioned into Qs and (sec Definition 5). Let 
F' = FU {[,]}. We build a finite length-decreasing rewriting system R whose 
Cayley-type graph is the transition graph of M. Let the alphabet of R be 
Zi = i: U r' U (r' X g) U 5*, where S = {va, v'^, sa \ a € S} is a, new set of 
symbols disjoint from F and Q. Elements {x,q) of F' x Q will be noted Xq. 
For convenience, for any set X, X, will denote X L) {X x Qs), and x, any 
symbol in {x},. 

There are several important points which the rules of R must ensure: 

1 . Only words of the form (e U [, ) T* (e U ] , ) , i.e. words in which only external 
control states may occur and in which no symbol occurs to the left of a left 

bracket or to the right of a right bracket, should be normal forms (please 
be aware that F, denotes the set F U (F x Qs), and not F U {F x Q)): 

x[,^[,, ]»y ^]», s^s 

for all X e Z\, y€A\S, s e S U S U {F' x Q^). 

2. When a letter a in is added to the right of an irreducible word u, one 
should ensure that u actually represents a legal configuration^, i.e. that 
u is of one of the forms [qF*], [F* AqF*\ or [F*]q for A G T, g € Q: 

]a^Va\, ]qa^Va']q, AVa ^ VaA, AqVa ^ vj Aq, 

Avd Va A, {qVa ^ [Sa, {Va ^ {Sa- 

3. Finally, once it has been made sure that the word represents a legitimate 
LBM configuration, one should simulate an insertion operation followed 
by any number of e-transitions of the LBM: 

SaA ^ ASfx., SaBp ^ GqB 

for all a&S, A,B,C gF and pB qCB e 6, and 

ApC — > BCq, CAp — > GqB ApG — > Gq 

for all pA 9-B+, pA qB—, pA g G 5 (respectively). 

Note that this encoding of configuration differs from the one used previously, 
where control states were standalone symbols. 
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There is an edge u v in the Cayley-type graph Gr of R if and only 
if u and v arc words representing valid configurations of M from which no 
e-transition can be performed, i.e. observable configurations, and there exists 
a sequence of transitions labeled by as* of M by which u reaches v. There is 
a bijection between the edges and vertices of Gn and the transition graph of 
M, hence these two graphs are isomorphic. 

2 =^ 3: Let i? be a finite length-decreasing rewriting system, Gr its 
Cayley-type graph. For each letter a, we will show that the relation 

T„ = {{ua, v)\u-^v} = {{ua, v)\u,ve NF(iJ) A uaR*v} 

Gr 

is an incremental context-sensitive transduction by building a LBM Ma = 
{r,S,[,],Q,qo,F,S) recognizing Ta- 

For every pair {ua,v), Ma starts in configuration uai^v, and first has to 
check that u is a normal form of R by verifying that it contains no left- 
hand side of any rule in R. Second, Ma simulates the derivation of R on ua, 
applying one rewriting rule at a time until a normal form is reached. Due to 
non-determinism, there might be unsuccessful runs, but the pair is accepted if 
and only if one run reaches configuration v^v, meaning that R can normalize 
ua into v. Hence, a pair {ua, v) is in Ta if and only if {u, a, v) G Gr, meaning 
that the transduction graph of {Ta)aes is isomorphic to Gr. 

3 => 1: Let T = {Ta)aGi; be a finite set of incremental context- 
sensitive transductions defining a graph Gt, each Ta being recognized by 
an LBM Ma- We informally describe a normalized labeled LBM M whose 
transition graph Gm is isomorphic to Gt- 

Let q be the unique external control state of M, M should have a run 
labeled by ae* between configurations qu and qv whenever {u,v) G Ta, or 
cquivalently whenever the word u^ffv is accepted by Ma- This is done as 
follows. First, starting from configuration qu, M should perform an a-labeled 
transition, increasing its available tape space by 1, and step into an internal 
control state. It should then guess a word v, and write u=f/=v on its tape. 
Since Ta is incremental, this can be done using no more than |m| -|- 1 tape 
cells by writing two symbols in each cell. Then, M simulates the LBM Ma 
on input word u=ffv, while keeping an intact copy of v on the tape (this can 
again be done by a simple alphabet encoding). If the simulated run of Ma 
succeeds, M steps into external configuration qv by restoring the saved copy 
of V on the tape, otherwise it loops in a non-accepting internal state. By this 
construction, there is an edge {qu, a, qv) in Gm if and only if there is an edge 
(u, a, v) in Gt, hence both graphs are isomorphic. □ 

This shows that the three types of graphs presented in this section actually 
all define the same class, namely that of linearly bounded graphs. This variety 
of definitions will allow us to prove in a simpler way some of the properties 
of linearly bounded graphs. 

4 Structural properties 

Now that the class of linearly bounded graphs has been defined using three 
different formalisms, we can easily deduce some of their structural properties. 
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In particular, we look at the languages accepted by linearly bounded graphs 
and some of their closure properties. We also give some insight about the 
relation between linearly bounded graphs and deterministic context-sensitive 
languages, and conclude with a few logical properties. But first, we compare 
our notions to related work. 



4.1 Comparison with existing work 

We now give a precise comparison of linearly bounded graphs with the re- 
striction of Turing graphs ([6]) to the linearly bounded case, and with the 
configuration graphs considered in [16]. 

4.1.1 Configuration graphs 

In [16], the configuration graphs of a class of offline linearly bounded ma- 
chines very similar to our labeled LBM are considered. However, they include 
in their definition a restriction to the set of configurations reachable from the 
initial configuration. Therefore, their class of configuration graphs is incom- 
parable to ours. As we will see in Prop. 8, the linearly bounded graphs are 
closed under restriction to the set of vertices reachable from a given vertex. 
Hence, it is not necessary to impose this restriction directly in the definition 
of the configuration graph. 

Apart from this restriction, our class of configuration graphs coincides 
with the configuration graphs of [16] and with those of [6] in the case of 
linearly bounded Turing machines. 

4.1.2 Transition graphs 

Knapik and Payet do not consider transition graphs: instead of characterizing 
the £-closure of configuration graphs, they prove a closure property of this 
class up to weak bisimulation [19], which is not a structural characterization. 
Nevertheless, it is straightforward to prove that their results can be extended 
to the class of transition graphs considered up to isomorphism, as mentioned 
in Sect. 4. 

In [6] , Caucal defines the transition graphs of Turing machines from their 
configuration graphs using the very general notion of e-closure: wherever 
there is a path labeled by e*ae* in the configuration graph, there is an edge 
labeled by a in its e-closure, for every letter a. Furthermore, the definition 
allows the restriction to an arbitrary rational set of vertices (potentially the 
whole set of vertices). Figure 3 illustrates on a small configuration graph the 
difference between these two approaches. 

Our approach has two main advantages. First, the e-closure operation as 
defined by Caucal may give rise to spurious non-determinism in the obtained 
graphs. This additional complexity is artificial and is eliminated using our 
definition. Furthermore, our notion is purely structural, and does not rely 
on the naming of vertices. But interestingly, we can show that both classes 
still coincide up to isomorphism: for every labeled linearly bounded machine 
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a 




Fig. 3 (a) Normalized LLBM configuration graph, (b) Transition graph of (a) as 
defined in [6] (with no restriction on vertices), (c) Linearly bounded graph associ- 
ated to (a). 



M and rational set i?, one can define a machine M' such that the transition 
graph of M following Caucal's definition with respect to R is isomorphic to 
the linearly bounded graph corresponding to M' in our framework. 

Proposition 6 The class of linearly bounded graphs coincides, up to isomor- 
phism, with the restriction of Turing graphs to the (uniform) linearly bounded 
case. 

Proof Let M be a normalized LLBM, C its configuration graph, R its (ra- 
tional) set of external configurations and G its transition graph defined as 

G = {c ^ c' \ c,c' € R Ac ^ c' € C}. 
Consider the graph G' defined as the restriction to R of the £-closure of C. 

= {c — > c I c, c G RAc — > c G C}. 

The graphs G and G' are equal, since by definition of external vertices, they 
have no outgoing e-transitions. 

Conversely, let M be a (not necessarily normalized) LLBM, C its configu- 
ration graph, and R a rational set of configurations of M. The binary relation 

in C X C is a 1-incremental context-sensitive transduction. Hence the 
graph G defined as the restriction to R (see Prop. 8) of the £-closure of C is 
a linearly bounded graph. □ 



4.2 Languages 

It is quite obvious that the language of the transition graph of a LLBM 
M between the vertex representing its initial configuration and the set of 
vertices representing its final configurations is the language of M. In fact, 
the choice of initial and final vertices has no importance in terms of the class 

of languages one obtains. 

Proposition 7 The languages of linearly bounded graphs between an initial 
vertex i and a finite set F affinal vertices are the context-sensitive languages. 
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Proof Let G be a linearly bounded graph labeled by E defined by a family 
{Ta)aeE of 1-incrcmcntal context-sensitive transductions. Wc will prove that 
L{G, i, F) is context-sensitive even if F is a context-sensitive set. 

Let # ^ 17 be a new symbol, we consider the graph G obtained from G 
by adding a loop labeled by # on each vertex in F. Obviously, G is a linearly 

bounded graph as — >■= {(/,/) | / € F} is a 0-incrcmental context-sensitive 
transduction. By Thm. 4, G is the transition graph of a LLBM M. Let cq 
be the configuration of M corresponding to the vertex i in G. Consider the 
LLBM M' whose initial configuration is i (see Rem. 1) and whose set of final 
states are the states that appear in the left-hand side of a #-rule (without 
loss of generality, wc can assume that the #-rules do not depend on the 
current value of the cell). It is easy check that M' accepts L{G, i, F) and by 
Prop. 2, L{G,i,F) is context-sensitive. 

For the converse implication, let L be a context-sensitive language and 
M the normalized LLBM constructed in Prop. 2 accepting L. The transi- 
tion graph of M traces L from the initial configuration to an infinite set of 
configurations (i.e the configurations with state qa). Therefore, we need to 
adapt the construction of Prop. 2. We add a new state g/ to the machine 
M and whenever M makes an e-transition to enter state qa, wc add the 
ability to enter configuration [qf] by a sequence of e-transitions. As [qf] has 
no out-going edges, it is an external configuration and it is easy to see that 
L = L{G,Co,[qf]). a 

Remark 4 When a linearly boimded graph is explicitly seen as the transition 
graph of a LLBM, as a Cayley-type graph or as a transduction graph, i.e. 
when the naming of its vertices is fixed, considering context-sensitive sets of 
final vertices does not increase the accepted class of languages. 



4.3 Closure properties 

Linearly bounded graphs enjoy several good properties, which will be espe- 
cially important when comparing them to other classes of graphs related to 
LBMs or context-sensitive languages (see Section 5). 

Proposition 8 The class of linearly bounded graphs is closed under restric- 
tion to reachable vertices from any vertex and under restriction to a context- 
sensitive set of vertices. 

Proof We first prove the closure under restriction to a context-sensitive set 
of vertices. Let G be a linearly bounded graph defined by a family of {Ta)aei: 
of 1-incremental context-sensitive transduction. By Prop. 5, the transduction 
Tari{{u,v) \ u £ F,v € F and|t;| < |u|-t-l} is 1-incremental context-sensitive. 
It follows that G\f is a linearly bounded graph. 

Let us now prove the closure of linearly bounded graphs under restriction 
to reachable vertices. Let G be a linearly bounded graph given by a family 
of 1-incremental context-sensitive transductions {Ta)aei: ^■iid uq a vertex in 
Vg- Each transduction Ta for a G 17 is accepted by a LBM Ma- Consider the 
graph G' obtained by restricting G to its set of vertices reachable from uq. 
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Fig. 4 The graph G for some L containing hb but not h. 



Note that this set is in general not context-sensitive, as illustrated by the 
following example. 

Example 4 Consider a language L C {a, b}* that is recognizable in exponen- 
tial space but not in linear space (i.e. L e NSPACE[2"] \ NSPACE[n]). The 
existence of such a language is guaranteed by Thm. 2. Consider the graph G 
labeled by {a, b, ft} and defined by: 

— tlu — ^ jjux for all u G {a, b}* and x E {a, 6}, 

G 

— M"+^ for all u e {a, b}* and n + 1 < 2l"l, 

G 

— and tJutt^'"' u for all u € L. 

G 

The edge relations of this graph are l-incremental context-sensitive trans- 
ductions, hence G is linearly bounded. Figure 4 illustrates the construction 
of this graph. The set of vertices reachable from ft restricted to {a, b}* is 
precisely the language L which by definition is not context-sensitive. 

Therefore, we need to adopt a different naming of the vertices of G'. We 
construct a graph H isomorphic to G' which is defined by a family of 1- 

incremental context-sensitive transductions. For any pair of vertices u and 
u S Vg, we write u v if there exists a path from w to v in G using only 
vertices of size less than or equal to n. Note that since u and v are considered 
as part of any path between u and v, then necessarily n > max(|u|, \v\). The 
set of vertices of H ihVh = | u G r*,uo ^\u\+n ^ ^md uq 7^|„|_|_„_i u}. 

Note that for all u G Vc, there is a unique n such that wD" G Vh, and that 
ueVg\Vg' =^ ^ Vh for ah n. For all a G T, we take wD" vD"^ 

H 

iff uD" G Vh, vD"^ G Vh and u^v. 

G 

The graph H is isomorphic to G' . It is easy to see that the mapping 
p G Vqi I— > Vff associating to every u G Vg' the unique G Vjj is a 

bijection and a graph morphism from G' to H . We first prove that Vh is 

context-sensitive and that for all a E S, — ^ is a l-incremental context- 

H 

sensitive transduction. It will then follow that H and G' are linearly bounded. 
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1. Consider the language L+ equal to {-«□" | uq ^\u\+n u, n G N} and the 
language L~ equal to | uo =>|„|+„_i u, n € N}, it is easy to check 
that Vh = L'^ n L~ . If wc prove that L"*" and L~ are context-sensitive 
languages, it follows by Thm. 1 that Vr is a context-sensitive language. 
We construct a LBM M accepting L+. When starting with uD", M 
guesses a path from uq to u with vertices of length at most |u| + n. 
It starts with uq and guesses a word ui of length at most \u\ + n then 
simulates one of the M^'s to check that Uq Ui. Finally, M replaces 

G 

Uq by ui and iterates the process until Ui is equal to u. This can be done 
using at most 3(|w| -I- n) cells. A similar construction allows to recognize 
L-. 

2. For all a £ E, uD" vD"^ implies that |u| + m < |it| -1- n -|- 1, because 
Uq =>\u\+n u, u V and is 1-incremental. Therefore, is 1- 

G G M 

incremental. 

The language La = {uD^jji^n'" | u v} is accepted by a LBM that 

checks that uD" and vD^ belong to Vh (by simulating a machine accept- 
ing Vh) and that u v. Hence is a 1-incremental context-sensitive 

G H 

transduction. □ 

Example 5 This last construction applied to the graph of Ex. 4 gives the 

graph H defined by: 

- ttu — ^ ttwx for all u €E {a, &}* and .t G {a, b}, 

G 

- M" M""^^ for all u e {a, 6}* and n -f- 1 < 21"', 

G 

- and ttwf ^ un2'"'+^ for all u € X. 

Since all rational languages are context-sensitive, linearly bounded graphs 
are also closed under restriction to a rational set of vertices. This shows that 
it is not necessary to allow arbitrary rational restrictions in the definition of 
transition graphs of linearly bounded machines, since such a restriction can 
be directly applied to the set of external configurations of a machine. 

Finally, we can extend the result of [16] to the class of linearly bounded 
graphs, and show that they are closed under synchronized products [1]. The 
synchronized product of two graphs G and G" with labels in S and S' with 
respect to a set of constraints C C Z" x Z" is the graph 

G®G' = {{u, v) [u', v')\u^u' ^v ^v' ^ (a, 6) G C}. 

G G 

Proposition 9 The class of linearly bounded graphs is closed under synchro- 
nized products, up to isomorphism. 

Proof It is straightforward from this definition that if G and G' are both lin- 
early bounded, defined for instance as context-sensitive transduction graphs, 
the binary edge relations of their synchronized product are also incremental 
context-sensitive transductions. □ 
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Fig. 5 Configuration graph and deterministic transition graph of a non-deter- 
ministic labeled LBM 



4.4 Deterministic linearly bounded graphs 

We now consider the relations between the determinism of linearly bounded 
graphs and the determinism of the linearly bounded machines defining them. 
As a first remark, it is straightforward to notice that there exist non-deter- 
ministic labeled LBMs whose transition graphs are deterministic. More pre- 
cisely, any machine in which, from any given configuration, at most one 
external configuration is reachable by a partial run labeled by s*, has a de- 
terministic transition graph. Figure 5 illustrates this fact. 

In fact, we can show that any LLBM can be transformed (while preserving 
the accepted language) in order to verify this property. Consequently, as 
expressed by the following proposition, all context-sensitive languages can 
be accepted by a deterministic linearly bounded graph. 

Proposition 10 For every context-sensitive language L, there exists a de- 
terministic linearly bounded graph G, a vertex i and a rational set of vertices 
F of G such that L = L{G, {i},F). Moreover, G can be chosen to be a tree. 

Proof Let L C E* be a context-sensitive language, we build a linearly boun- 
ded graph G labeled by S whose vertices are words of the form Aw or Rw, 
with w € S* and A,R ^ S, and whose edges are defined by the following 
1-incremental context-sensitive transductions: 

Au Aua, Rv Ava for all u & L,v ^ L and ua, va G L, 
Au Rua, Rv Rva for all u G L,v ^ L and ua, va ^ L. 

The language of this deterministic graph between vertex X where X is equal 
to A if £ e L or equal to i? if £ ^ L and the rational set AF* is precisely L. 
Moreover, G is a tree isomorphic to the complete infinite U-Iabeled tree. □ 

Remark 5 The previous result does not stand for a finite set of final vertices 
even if we only consider deterministic context-sensitive languages. Consider 
for example, the language L = {{a"b)*\n e N}. Suppose that there exists a 
deterministic graph G such that L = L{G,i,F) for some finite set F, then 

there would exists m ^ n, such that i -^-^ / and i ^^—^ f. As f ^^—^ f for 

G G G 

some /' e F, it would follow that i ° f. 
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Fig. 6 Configuration graph and deterministic transition graph of a non-deter- 
ministic terminating labeled LBM 



Of course, wo cannot conclude from this that the languages of determinis- 
tic linearly bounded graphs are the deterministic context-sensitive languages, 
which would amount to answering the general question raised by Kuroda [17] 
whether deterministic and non-deterministic languages coincide. However, if 
wc only consider terminating linearly bounded machines (which have no in- 
finite run on any given input word) the class of transition graphs we obtain 
faithfully illustrates the determinism of the languages. 

Remark 6 Even for terminating LLBMs, the determinism of a transition 
graph does not imply that the machine defining it is deterministic. Figure 6 
illustrates this fact. The idea of the following proof is to show that any ter- 
minating LLBM whose transition graph is deterministic can be determinized 
without changing the structure of the graph. 

Proposition 11 The languages of deterministic transition graphs of termi- 
nating linearly hounded machines (between an initial vertex and a rational 
set of final vertices) are the deterministic context-sensitive languages. 

Proof Let L be a deterministic context-sensitive language. By Prop. 2, there 
exists a deterministic terminating LLBM M accepting L. By definition, its 
transition graph Gm is deterministic, and it accepts L between the vertex 
corresponding to the initial configuration and those corresponding to the final 
configurations of M . 

Conversely, let Gm be the deterministic transition graph of a terminating 
LLBM M = (r, r, [,], (5,0-0,^,5), we construct a deterministic LLBM N 
whose transition graph Gat is equal to Gm- The machine N is obtained from 
M by keeping exactly one rule in 5 with a given left-hand side and label. The 
machine N is equal to {F, E, [,],Q, qo, F, 6') where 6' is a subset of 6: 

— if pA qU G 5 then there exists pA q'U' G 5', 

- iipA^qUG S' then for all pA q'U' eS',q = q' and U = U' . 

By construction, N is deterministic. Moreover, the set of external configura- 
tions of N is equal to the set of external configurations of M. It remains to 
prove that for any two external configurations c and c', c c' if and only 

Gm 

if C > C . 
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If c c', there exists a path between c and c' labeled by ae* in the 

configuration graph Cm of M. As Gm is deterministic and terminating, any 

maximal path in Cm labeled by ae* ends in c'. This property still holds for 
the configuration graphs of Cn- Suppose by contradiction that there exists 
a maximal path in Cn labeled by ae* starting from c and ending in c" ^ c'. 
This implies the c" has out-going e-edges in Cm and not in Cjv which is 
impossible by construction of N. 

Conversely if c c' then, by construction of N, we have c c'. Hence, 

Giv GjM 

it follows that Gm = Gn- □ 

Remark 7 Other equivalent definitions of deterministic transition graphs of 

terminating labeled LBMs can be given: they coincide with the Caylcy-type 
graphs of length-decreasing rewriting systems with unique normal forms'^, 
and to the graphs of deterministic terminating incremental context-sensitive 
transductions. The intuitive reason behind this equivalence is that, in every 
such case, for each label the corresponding edge relation of the graph is a 
partial function from vertices to vertices. 

Remark 8 One can not use the previous constructions to determinize any 
LLBM. Indeed, the construction from Prop. 10 produces non terminating 
machines in general, and the construction from Prop. 1 does not preserve the 
structure of a machine's transition graph. 



4.5 Logical properties 

To conclude this section on properties of linearly bounded graphs, we in- 
vestigate the decidability of the first-order theory of configuration graphs of 
LLBMs and linearly bounded graphs. Due to the high expressive power of 
the model considered, only local properties expressed in first-order logic can 
be checked on LLBM configuration graphs. 

Proposition 12 Configuration graphs of linearly hounded Turing machines 

have a decidable first- order theory. 

Proof This is a direct consequence of the fact that configuration graphs of 
LLBMs are synchronized rational graphs, which have a decidable first-order 
theory [2,15]. □ 

However, as remarked by [16], there exists no algorithm which, given a 
LLBM M and a first-order sentence cj), decides whether the transition graph 
of M satisfies (j). This statement can be strengthened to the following propo- 
sition. 

Proposition 13 There exists a linearly bounded graph with an undecidable 
first- order theory. 



A rewriting system has unique normal forms if any word can be derived into at 
most one normal form. 
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Proof Consider a fixed enumeration (M„)„gN of all (unlabeled) Turing ma- 
chines, and a labeled Turing machine M whose language is the set of all 
words of the form #" such that machine M„ halts on the empty input. Us- 
ing only e-transitions, M guesses a number n and writes #" on its tape, then 

simulates machine number n on the empty input. If the machine halts, then 
M reads word and stops. All accepting runs of M are thus labeled by 
£*#" for some n. 

When replacing e by an observable symbol r, the configuration graph 
C of M is a linearly bounded graph. Let C be its restriction to vertices 
reachable from the initial configuration of Af, the graph C" is still linearly 
bounded by Prop. 8. For all n, the formula expressing the existence of a 
path labeled by r#" ending in a vertex with no successor is satisfied in C 
if and only if machine Af„ halts on input e. The set of all such satisfiable 
formulas is not recursive. Hence the first-order theory of C" is undecidable 
(not even recursively enumerable). □ 



5 Comparison with rational graphs 

We will now give some remarks about the comparison between linearly boun- 
ded graphs and several different sub-classes of rational graphs. First note 
that since linearly bounded graphs have by definition a finite degree, it is 
therefore only relevant to consider rational graphs of finite degree. However, 
even under this structural restriction, rational and linearly-bounded graphs 
are incomparable, due to the incompatibility in the growth rate of their 
vertices' degrees. 

A first observation is that in a rational graph the out-degree at distance n 
from any vertex can be c'^ for some constant c, whereas in a linearly bounded 
graph it is at most c". 

Lemma 1 For any linearly hounded graph G and any vertex x, there exists 
c e N such that the out-degree of G at distance n> of x is at most c". 

Proof Let {Ta)a&s be a set of incremental context-sensitive transductions 
describing G and fc^ G N such that Ta is a fca-incremental context-sensitive 
transduction. We take k to be the maximum of {ka \ a G S} L) {\x\}. At 
distance n > of a;, a vertex has length at most k{n + 1) and hence the out- 
degree is bounded by the number of vertices of length at most k{n + 2) which 
is less than |p|fc("+2)-i-i. Hence, there exists c G N such that the out-degree 
is bounded by c". □ 

Figure 7 shows a rational graph whose vertices at distance n from the 
root A have out-degree 2^"^\ This graph is thus not linearly bounded. 

Conversely, in a rational graph of finite degree, the in-degree at distance n 
from any vertex is at most c*^ for some c G N, in a linearly bounded graph it 
can be as large as f{n) for any mapping / from N to N recognizable in linear 
space (i.e. such that the language {0"l-^(") | n G N} is context-sensitive). 

Lemma 2 For any mapping / : N N recognizable in linear space, there 
exists a linearly bounded graph G with a vertex x such that the in-degree at 
distance n > of x is f{n). 



Linearly Bounded Infinite Graphs 



25 



AA. 




B/BB 



Fig. 7 A finite degree rational graph (together with its transducer) which is not 
isomorphic to any linearly bounded graph. 




0»i/(") 



Fig. 8 A linearly bounded graph which is not isomorphic to amy rational graph. 



Proof Let / be a mapping from N to N recognizable in linear space, and let 
G/ be the linearly bounded graph defined using the following incremental 
context-sensitive transduction: 

T = {{u,uQ) I u G 0*} U {{u,ul) I u e 0"1", m < f{n)} 
U {{uv,u) \ueO*,v el* and < f{\u\)}. 

Note that, since context-sensitive languages are closed under complement, it 
is not difficult to see that the set {0"I™ | m < /(n)} is context-sensitive for 
every / recognizable in linear space. Figure 8 illustrates the construction of 
Gf. The in-degree of vertex 0" at distance n from the root e is equal to /(n), 
for any mapping / recognizable in linear space. □ 

An instance of such a mapping is / : n i— > 2 , which is more than the 
in-degrcc at distance n of a vertex in any rational graph of finite degree. 
From these two observations, we easily get: 

Proposition 14 The classes of finite degree rational graphs and of linearly 
bounded graphs are incomparable. 

Since finite-degree rational graphs and linearly bounded graphs are in- 
comparable, we investigate more restricted sub-classes of rational graphs. 
For synchronized graphs of finite out-degree, we have the following result: 
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Proposition 15 The synchronized graphs of finite out-degree form a strict 
sub-class of linearly bounded graphs. 

Proof By Prop. 4, the synchronized relations of finite image arc fc-incre- 
mental. Hence, the synchronized graphs of finite out-degree are Unearly boun- 
ded graphs. The strictness can be deduced from the fact that synchronized 
graphs arc not closed under restriction to reachable vertices from a given 
vertex (cf Prop. 8), or that synchronized graphs have decidable first-order 
theories (cf Prop. 13). □ 

By restricting the out-degree even further, we can establish the following 
comparison: 

Theorem 5 The rational graphs of bounded out-degree form a sub- class of 
linearly bounded graphs (up to isomorphism). 

Proof The inclusion result is based on a uniformization result for rational 
relations of bounded image due to Weber. A relation is said to be A;- valued 
if for all w G Dom(i?), we have \{x \ {'w,x) € R}\ < k. In [28], it is proved 
that for any fc-valued rational relation R, there exist k rational functions^ 
Fi,...Fk such that R = Uie[i,fc] Pi- 

Let G = {Ta)a&E be a rational graph over F whose out-degree is bounded 
by k. Each relation Ta is therefore fc-valucd and hence there exist k rational 
functions Fa^, . . . Fa^ such that Ta = Uig[i k] P°-i- write X = {oi \ a e 

S BXidi e [i,fc]}. 

We will represent each vertex x of G by a pair (w, t) G Vq x X* such that 
X = Ft,^, (. . . {Ft^{w))). However, there can be several such pairs for a given 
vertex. Wc define a total order < on Vg x X* and associate to x the smallest 
such pair. First, let <r and <x be two arbitrary total orders over F and X 
respectively and let -< r and -<x be the lexicographic orders induced by <r 
and <x respectively. 

(m, i)<(n, r) iff |m| + |f| < |n| -I- |r| 

or |m| + \t\ = \n\ + \r\ and m -<r n 

or |m| + \t\ = \n\ + \r\, m = n and t -<x r 

It is straightforward to prove that < is a total order onVcxX* . We now prove 
that the function N associating to any pair {w, t) the smallest pair (m, r) such 
that Fi{w) ~ Fr{m) is a 0-incremental context-sensitive transduction. 

There are two important points in this proof. The first one is to be 
able to check in linear space that for any two given pairs (m,r) and {w,t), 
Fr{m) = Ft{w). In other terms, we want to prove that the language L = 
{m'ir$t^w I Fr{m) = Ft{w)} is context-sensitive. 

Let X be a finite alphabet disjoint from but in bijcction with X (for all 
x € X, we write x for the corresponding symbol m. X) , we consider the 
rational graph G defined by the family of transducers {Fa)aex U (Ja)aex 
where for all a e X, = F~^. It is easy to check that for all m,w G F* 



* A rational relation which maps exactly one image to each element of its domain 
is called a rational function. 
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and all r,t E X*. there exists a path m => w iff Fr{m) = Ft{w) where 

G _ 

t = t\t\...ti. By Cor. 1, the language {^M/ | w £ L{G,i, ,f)]r]r*^X* X^F* 
is a context-sensitive language. It follows that L is also context-sensitive. 

The second point is to show that the language L' = {m^rtf^w \ N{{w,t)) 
= (m, r)} is itself context-sensitive, which implies that N is indeed a context- 
sensitive transduction. Given {w, t) and (m, r), let (mi, ri), . . . , (rrife, rfe) be 
an enumeration of all pairs smaller than (m, r) with respect to <. A linearly 
bounded machine accepting L' successively tests for all i G [1,A;] whether 
mittri$tjjw ^ L, which is possible since L is context-sensitive and context- 
sensitive languages are closed under complement. If any of these k tests fails, 
then the whole computation fails. Otherwise, it only remains to check that 
mi'^ri%t^w e L. 

N is 0-incremental since by definition of the total order <, (m,r) < 
{w,t) => \m\ + \r\ < \w\ + \t\, and all steps of the construction of the 
machine accepting L' can be done in linear space. 

We can now define the linearly bounded graph G' with vertices in r*#X* 
using the set of transductions {Ta)aes with: 

Ta = \J {{wi^x, w'#y) I N{{w, x)) = {w, x) and N{w, xai) = {w' , y)}. 
ie[i,fc] 

As N is 0-incremental, Ta is 1-incremental. The mapping p from Vg to Vq x 
X* associating to a vertex x E Vg the smallest {w. t) such that x = Fi{u!) 
is a bijection from Vq to Vc, which induces an isomorphism from G to G' . 
Hence, G is a linearly bounded graph, which concludes the proof of inclusion. 

□ 

Example 6 Let P be a singleton alphabet, and consider the pair of transduc- 
tions Ta = {{n,2n)\n > 1} and Tb = {{11,71 — 3)\n > 4} on the domain of 
words over F seen as unary-coded positive integers. Let us apply the con- 
struction from the previous inclusion proof to the bounded-degree rational 
graph G defined by Ta and T^. Here, <r is trivial, and we take a <x b. 
The graph G and the linearly bounded graph G' obtained by applying the 
construction are shown on Fig. 9. Note for instance how vertex 13 in G is 
represented by vertex (2, a^b) in G' because (2, a^b) is the smallest pair {u, v) 
such that u 13. 

G 

In fact, we can prove that the previous inclusion is strict. 

Theorem 6 There exists a linearly bounded graph of bounded degree which 
is not a rational graph. 

Proof We now prove that there exists a linearly bounded graph of bounded 
degree which is not isomorphic to any bounded degree rational graph. Let 
us first establish that linearly bounded graphs of bounded degree are not 
closed under edge reversal. Let L C {0, 1}^ be a language in NSPACE[2"] \ 
NSPACE[n] (cf Thm. 2), we define a linearly bounded graph G with vertices 
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Fig. 9 Bounded- degree rational graph G and isomorphic lin. bounded graph G'. 



in 0{0, 1}*#* U 0{0, 1}* and edges defined by: 

= {{w,wx)\w e 0{0, 1}*} U {(w, lux) I w e 0{0, T}*} for x G {0, 1} 

^ = {(w, w#) \ w = uv,ue 0{0, 1}*, V e #* and \v\ < 2l"l } 

U {{w, u)\w = uv,ue 0{0, 1}*, ve#*,\v\ = 2l"l and u e L} 

The relations — and are incremental context-sensitive transduc- 
tions. In fact, as L belongs to NSPACE[2"], the language \ w € L} 
belongs to NSPACE[n]. The construction of Gl is illustrated by Figure 10. 

Suppose that linearly bounded graphs of bounded degree were closed 
under edge reversal. It would follow that H obtained from G by reversing 
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Fig. 10 The graph G for some L containing 11. 



the edges labeled by # is also a linearly bounded graph. As H is linearly 
bounded, we can assume that and are incremental context- 

H H H 

sensitive transductions. Let x be the vertex of H corresponding to 0. The 

set of vertices F = Dom( — >) is a context-sensitive language. It follows from 

u 

Proposition 7 that L(H, {x}, F) is context-sensitive. As L = L{H, {x}, F) n 
{0, 1}*, L would also be context-sensitive which contradicts its definition. 
As rational graphs are closed under edge reversal, it follows from Thm. 

5 that rational graphs of bounded degree are strictly contained in linearly 
bounded graphs of bounded degree. □ 

It may be interesting at this point to recall that there are strong reasons 
to believe that the languages accepted by finite degree synchronized graphs 
are strictly included in context-sensitive languages (see [3]). Furthermore, all 
existing proofs that the rational graphs accept the context-sensitive languages 
break down when the out-degree is bounded. It is not clear whether rational 
graphs of bounded degree accept all context-sensitive languages. However, as 
noted in Prop. 10, it is still the case for bounded degree linearly bounded 
graphs, and in particular for deterministic linearly bounded graphs. 

6 Conclusion 

This paper gives a natural definition of a class of canonical graphs associ- 
ated to the observable computations of labeled linearly bounded machines. It 
provides equivalent characterizations of this class as the Cayley-type graphs 
of length-decreasing term-rewriting systems, and as the graphs defined by a 
sub-class of context-sensitive transductions which can increase the length of 
their input by at most a constant number of letters. Although of a sensibly 
different nature from rational graphs, we showed that all rational graphs of 
bounded degree are linearly bounded graphs of bounded degree, and that 
this inclusion is strict. This leads us to consider a more restricted notion of 
infinite automata, closer to classical finite automata (as was already observed 
in [3]), and to propose a hierarchy of classes of infinite graphs of bounded 
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Fig. 11 A Chomsky- like hierarchy of bounded-degree infinite graphs. 



degree accepting the classes of languages of the Chomsky hierarchy, in the 
spirit of [8] (see Fig. 11). 

Finite graphs obviously have a bounded degree, they accept rational lan- 
guages. The transition graphs of real-time pushdown automata, which ac- 
cept all context-free languages, are the regular graphs [22], or equivalently 
the bounded degree HR graphs [10] and bounded degree prefix-recognizable 
graphs [7]. By Prop. 10, the languages of deterministic linearly bounded 
graphs are the context-sensitive languages. Deterministic graphs have by def- 
inition a bounded degree, so bounded degree linearly bounded graphs also 
accept the same class of languages. Finally, since deterministic Turing ma- 
chines have bounded-degree transition graphs and accept all recursively enu- 
merable languages, we can also restrict the class of Turing graphs to bounded 
degree. 
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